Toward Discriminative Learning of Planning Heuristics

نویسندگان

  • Yuehua Xu
  • Alan Fern
چکیده

We consider the problem of learning heuristics for controlling forward state-space search in AI planning domain. We draw on a recent framework for “structured output classification” (e.g. syntactic parsing) known as learning as search optimization (LaSO). The LaSO approach uses discriminative learning to optimize heuristic functions for search-based computation of structured outputs and has shown promising results in a number of domains. However, the search problems that arise in AI planning tend to be qualitatively very different from those considered in structured classification, which raises a number of potential difficulties in directly applying LaSO to planning. In this paper, we discuss these issues and describe a LaSO-based approach for discriminative learning of beamsearch heuristics in AI planning domains. Our preliminary results in three benchmark domains are promising. In particular, across a range of beam-sizes the discriminatively trained heuristic outperforms the one used by the planner FF and another recent non-discriminative learning approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discriminative Learning of Beam-Search Heuristics for Planning

We consider the problem of learning heuristics for controlling forward state-space beam search in AI planning domains. We draw on a recent framework for “structured output classification” (e.g. syntactic parsing) known as learning as search optimization (LaSO). The LaSO approach uses discriminative learning to optimize heuristic functions for search-based computation of structured outputs and h...

متن کامل

The Nature of Heuristics II : Background and Examples

Machine learning can be categorized along many dimensions, an important one of which is `degree o f human guidance or forethought' . This continuum stretches from rote learning, through carefully guided concept-formation by observation, out toward independent theory formation . Six years ago, the AM program was constructed as an experiment in this latter kind of learning by discovery . Its sour...

متن کامل

Stochastic margin-based structure learning of Bayesian network classifiers

The margin criterion for parameter learning in graphical models gained significant impact over the last years. We use the maximum margin score for discriminatively optimizing the structure of Bayesian network classifiers. Furthermore, greedy hill-climbing and simulated annealing search heuristics are applied to determine the classifier structures. In the experiments, we demonstrate the advantag...

متن کامل

Efficient Heuristics for Discriminative Structure Learning of Bayesian Network Classifiers

We introduce a simple order-based greedy heuristic for learning discriminative structure within generative Bayesian network classifiers. We propose two methods for establishing an order of N features. They are based on the conditional mutual information and classification rate (i.e., risk), respectively. Given an ordering, we can find a discriminative structure with O ( Nk+1 ) score evaluations...

متن کامل

Learning to Rank for Synthesizing Planning Heuristics

We investigate learning heuristics for domainspecific planning. Prior work framed learning a heuristic as an ordinary regression problem. However, in a greedy best-first search, the ordering of states induced by a heuristic is more indicative of the resulting planner’s performance than mean squared error. Thus, we instead frame learning a heuristic as a learning to rank problem which we solve u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006